Search CORE

944 research outputs found

An End-to-End License Plate Localization and Recognition System

Author: Zhu Siyu
Publication venue: RIT Scholar Works
Publication date: 01/01/2015
Field of study

An end-to-end license plate recognition (LPR) system is proposed. It is composed of pre-processing, detection, segmentation and character recognition to find and recognize plates from camera based still images. The system utilizes connected component (CC) properties to quickly extract the license plate region. A novel two-stage CC filtering is utilized to address both shape and spatial relationship information to produce high precision and recall values for detection. Floating peak and valleys (FPV) of projection profiles are used to cut the license plates into individual characters. A turning function based method is proposed to recognize each character quickly and accurately. It is further accelerated using curvature histogram based support vector machine (SVM). The INFTY dataset is used to train the recognition system. And MediaLab license plate dataset is used for testing. The proposed system achieved 89.45% F-measure for detection and 87.33% accuracy for overall recognition rate which is comparable to current state-of-the-art systems

CiteSeerX

RIT Scholar Works

Recommended from our members

Particle Dynamics Simulation toward High-Shear Mixing Process in Many Particle Systems

Author: Zhu Siyu
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2018
Field of study

Granular materials appear in a broad range of industrial processes, including mineral processing, plastics manufacturing, ceramic component, pharmaceutical tablets and food products. Engineers and scientists are always seeking efficient tools that can characterize, predict, or simulate the effective material properties in a timely manner and with acceptable accuracy, such that the cost for design and develop novel composite granular materials could be reduced. The major scope of this dissertation covers the development, verification and validation of particle system simulations, including solid-liquid two-phase particle mixing process and foaming asphalt process. High shear mixing process is investigated in detail with different types of mixers. Besides particle mixing study, one liquid-gas two phase foaming asphalt simulation is studied to show the broad capacity of our particulate dynamics simulation scheme. Methodologies and numerical studies for different scenarios are presented, and acceleration plans to speed up the simulations are discussed in detail. The dissertation starts with the problem statement, which briefly demonstrates the background of the problem and introduces the numerical models built from the physical world. In this work, liquid-solid two-phase particle mixing process is mainly studied. These mixing processes are conducted in a sealed mixer and different types of particles are mixed with the rotation of the mixer blades, to obtain a homogeneous particle mixture. In addition to the solid-liquid particle mixing problem, foaming asphalt problem, which is a liquid-gas two phase flow problem is also investigated. Foaming asphalt is generated by injecting a small amount of liquid additive (usually water) to asphalt at a high temperature. The volume change during this asphalt foaming process is studied. Given the problem statement, detailed methodologies of particle dynamics simulation are illustrated. For solid-liquid particle mixing, Smoothed Particle Hydrodynamics (SPH) and Discrete Element Method (DEM) are introduced and implemented to simulate the dynamics of solid and liquid particles, respectively. Solid-liquid particle interactions are computed according to Darcy`s Law. Then the proposed SPH coupling DEM model is verified by three classical case studies. For foaming asphalt problems, a SPH numerical model for foaming asphalt simulation is proposed, and simulations with different water contents, pressures and temperatures are conducted and the results agree with the experiments well. The coupled SPH-DEM method is applied to the particle mixing process, and several particle mixing numerical studies are conducted and these simulations are analyzed in multiple aspects. For the solid-liquid particle mixing problem, liquid plays an important role in the mixing performance. The effects of liquid content and liquid viscosity on mixing performance are studied. The mixing indexes of the mixture are applied to analyze the mixing quality, and the differences between three kinds of mixing indexes are discussed. Then mixers commonly used in industry such as Double Planetary Mixer (DPM) are modeled in mixing simulation and their results are compared with the experiments. Similar to other numerical simulation problems, the scale of the model and the accuracy of the simulation results are constrained by the computational capacity. Our in-house software package Particle Dynamics Parallel Simulator(PDPS) has been used as a platform to implement the algorithms above and conduct the simulations. Two parallel computing methods of Message Passing Interface (MPI) parallel computing and Graphics Processing Unit (GPU) acceleration have been used to accelerate the simulations. Speedup results for both MPI parallel computing and GPU methods are illustrated in the case studies. In summary, a comprehensive approach for particle simulation is proposed and applied to particle mixing process and asphalt foaming simulation. The simulation results are analyzed in various aspects to provide valuable insights to the problems studied in this work. Given the improvement of computational capacity, particle dynamics in higher resolution and simulations in more complex configurations can be obtained. This particle simulation platform is general and it can be straightforwardly extended to many-particle systems with more particle phases and solid-liquid-gas dynamics problems

Columbia University Academic Commons

Text Detection in Natural Scenes and Technical Diagrams with Convolutional Feature Learning and Cascaded Classification

Author: Zhu Siyu
Publication venue: RIT Scholar Works
Publication date: 12/05/2016
Field of study

An enormous amount of digital images are being generated and stored every day. Understanding text in these images is an important challenge with large impacts for academic, industrial and domestic applications. Recent studies address the difficulty of separating text targets from noise and background, all of which vary greatly in natural scenes. To tackle this problem, we develop a text detection system to analyze and utilize visual information in a data driven, automatic and intelligent way. The proposed method incorporates features learned from data, including patch-based coarse-to-fine detection (Text-Conv), connected component extraction using region growing, and graph-based word segmentation (Word-Graph). Text-Conv is a sliding window-based detector, with convolution masks learned using the Convolutional k-means algorithm (Coates et. al, 2011). Unlike convolutional neural networks (CNNs), a single vector/layer of convolution mask responses are used to classify patches. An initial coarse detection considers both local and neighboring patch responses, followed by refinement using varying aspect ratios and rotations for a smaller local detection window. Different levels of visual detail from ground truth are utilized in each step, first using constraints on bounding box intersections, and then a combination of bounding box and pixel intersections. Combining masks from different Convolutional k-means initializations, e.g., seeded using random vectors and then support vectors improves performance. The Word-Graph algorithm uses contextual information to improve word segmentation and prune false character detections based on visual features and spatial context. Our system obtains pixel, character, and word detection f-measures of 93.14%, 90.26%, and 86.77% respectively for the ICDAR 2015 Robust Reading Focused Scene Text dataset, out-performing state-of-the-art systems, and producing highly accurate text detection masks at the pixel level. To investigate the utility of our feature learning approach for other image types, we perform tests on 8- bit greyscale USPTO patent drawing diagram images. An ensemble of Ada-Boost classifiers with different convolutional features (MetaBoost) is used to classify patches as text or background. The Tesseract OCR system is used to recognize characters in detected labels and enhance performance. With appropriate pre-processing and post-processing, f-measures of 82% for part label location, and 73% for valid part label locations and strings are obtained, which are the best obtained to-date for the USPTO patent diagram data set used in our experiments. To sum up, an intelligent refinement of convolutional k-means-based feature learning and novel automatic classification methods are proposed for text detection, which obtain state-of-the-art results without the need for strong prior knowledge. Different ground truth representations along with features including edges, color, shape and spatial relationships are used coherently to improve accuracy. Different variations of feature learning are explored, e.g. support vector-seeded clustering and MetaBoost, with results suggesting that increased diversity in learned features benefit convolution-based text detectors

RIT Scholar Works

Supersolid and pair correlations of the extended Jaynes-Cummings-Hubbard model on triangular lattices

Author: Greschner Sebastian
Guo Lijuan
Zhang Wanzhou
Zhu Siyu
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2019
Field of study

We study the extended Jaynes-Cummings-Hubbard model on triangular cavity lattices and zigzag ladders. By using density-matrix renormalization group methods, we observe various types of solids with different density patterns and find evidence for light supersolids, which exist in extended regions of the phase diagram of the zigzag ladder. Furthermore, we observe strong pair correlations in the supersolid phase due to the interplay between the atoms in the cavities and atom-photon interaction. By means of cluster mean-field simulations and a scaling of the cluster size extending our analysis to two-dimensional triangular lattices, we present evidence for the emergence of a light supersolid in this case also.Comment: 11 pages, 16 figure

arXiv.org e-Print Archive

Université de Genève: Yareta

Archive ouverte UNIGE

BCS-like disorder-driven instabilities and ultraviolet effects in nodal-line semimetals

Author: Syzranov Sergey
Zhu Siyu
Publication venue
Publication date: 01/05/2023
Field of study

We study the effects of quenched disorder on electrons in a 3D nodal-line semimetal. Disorder leads to significant renormalisations of the quasiparticle properties due to ultraviolet processes, i.e. processes of scattering in a large band of momenta, of the width exceeding the inverse mean free path. As a result, observables such as the density of states and conductivity exhibit singular behaviour in a broad range of disorder strengths, excluding a small vicinity of the singular point. We find that, for example, the density of quasiparticle states diverges as a function of the disorder strength

g

\rho(g,E)\propto |g_c(E)-g|^{-2}|E|

for

g

smaller than the critical value

g_c(E)

and crosses over to a constant for

g

very close to

g_c(E)

, where

E

is the quasiparticle energy. For certain disorder symmetries, a 3D disordered nodal-line semimetal can be mapped to a 2D metal with attractive interactions. The described disorder-driven instabilities in such a nodal-line semimetal are mapped to Cooper and exciton-condensation instabilities in a 2D metal. For other disorder symmetries, the respective instabilities are similar but not exactly dual. We discuss experimental conditions favourable for the observation of the described effects.Comment: 21 pages, 7 figure

arXiv.org e-Print Archive

Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions

Author: Ren Siyu
Zhu Kenny Q.
Publication venue
Publication date: 07/12/2020
Field of study

In this paper, we propose a novel configurable framework to automatically generate distractive choices for open-domain cloze-style multiple-choice questions, which incorporates a general-purpose knowledge base to effectively create a small distractor candidate set, and a feature-rich learning-to-rank model to select distractors that are both plausible and reliable. Experimental results on datasets across four domains show that our framework yields distractors that are more plausible and reliable than previous methods. This dataset can also be used as a benchmark for distractor generation in the future.Comment: To appear at AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications